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DETAILED ACTION 

This office action corresponds to application 10/729,889 filed 12/5/2003. 

Terminal Disclaimer 

The terminal disclaimer filed on 11/19/2006 is objected to, as copending application 
10/729,883 has not been identified on the submitted terminal disclaimer. Specifically, 
application 10/729,888 has been recorded twice. Application 10/729,883 was included in the 
nonstatutory obviousness-type double patenting rejection of the office action of 5/22/2006. 
Correction is kindly requested. 

Response to Amendment 

The Examiner acknowledges and has entered amendments made to the present 
application. Claims 2, 3, 19, and 20 have been cancelled while claims 33-34 have been newly 
added. Accordingly, claims 1,4-18 and 21-34 have been newly added. 

The Examiner would like to have Applicant acknowledge the following informalities 
found in the amended claims: 

On page 4 of the claims, claim 6 has been repeated twice. On claim 17 on page 6, the 
limitation "using linguistic information to extract" should be underlined as to indicate this is a 
new claim limitation not present in prior versions of the claims. Corrections are respectfully 
requested. 
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In light of these minor informalities, the Examiner has examined the pending claims to 
further expedite prosecution. 

Claim Rejections - 35 USC § 102 
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign countr>' or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

Claims 1, 4-18 and 21-34 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Gaizauskas et al. "Information Extraction: Beyond Document Retrieval" August 1998. 
('Gaizauskas' hereinafter). In the following passages and figures, Gaizauskas teaches: 

With respect to claim 1, a computer program product located to one or more storage 
media devices usable to perform integration of mixed format data, said computer program 
product comprising instructions executable by a computer to perform the functions of: 

accessing a feed of data records (page 18; retrieving documents from collections, page 
48; newsfeeds), said data records including both structured data and unstructured data ( step b) of 
figure 1 on page 20); 

the unstructured data of a particular data record including free text related to the 
structured data of that data record (figure 1, step b) ); 

extracting relational facts from the free text (page 27, paragraph 3 and page 29), said 
extracting step being performed using linguistic information from the free text (begirming of 
page 19, figure 3, first fiill paragraph of page 47; linguistic analysis/theory); 
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, producing a set of construed data from said unstructured data, each construed datum 
containing at least one relational fact (step d) of figure 1, page 21, at least steps 4-7 on page 34, 
and processing stages on page 36 and figure 3 on page 39 with description), 

each construed datum being relatable to the structured data of the data record in which 
said free text was found (steps b and e of figure 1); and 

integrating the construed data with the particular structured data to which the construed 
data relates (page 39 section 3.2.3 and second full paragraph of page 44). 

With respect to claim 4, a computer program product according to claim 1, further 
comprising the step of applying caseframes while performing said extracting step (last paragraph 
of page 22, first paragraph of page 23, and figure 3 on page 39). 

With respect to claim 5, a computer program product according to claim 1, wherein said 
instructions are further executable to perform the function of producing a new database 
containing the integrated data produced by said integrating (page 1, and page 50 section 5.1.3). 

With respect to claim 6, a computer program product according to claim 1 , wherein said 
data feed is a database, and wherein said instructions are further executable to perform the 
function of inserting the construed data into said database while performing said integrating step 
(introduction, first paragraph). 
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With respect to claim 7, a computer program product according to claim 1, wherein said 
instructions are further executable to perform the function of creating a new database containing 
the construed data (figure 1 on page 20 and section 5.1.3.)- 

With respect to claim 8, a computer program product according to claim 7, wherein said 
new database is a relational database which relates said relational facts to said structured data 
(page 52; conventional database and section 5.1.3.). 

With respect to claim 9, a computer program product according to claim 8, wherein the 
instructions are further executable to produce a file containing the integrated data produced by 
said integrating (number 2 on page 29 and figure 2 on page 35). 

With respect to claim 10, a computer program product according to claim 9, wherein the 
instructions are further executable to produce a file having a format selected from the group of 
XML, character separated values, spreadsheet formats and file-based database structures (figure 
1 on page 20 and number 2 on page 29). 

With respect to claim 11, a computer system including a computer program product 
according to claim 1, further comprising: a processing unit coupled to said one or more storage 
media devices, said processing unit being capable of executing said instructions; and an 
execution command unit, whereby operation of said instructions and said processing unit may be 
commanded or controlled (page 46, first fiill paragraph). 
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With respect to claim 12, a computer program product according to claim 1, wherein said 
instructions are further executable to combine like attributes for the extracted relational fact 
types produced in performing said extracting relational facts from the free text (figures. 4-5 and 
accompanying descriptions). 

With respect to claim 13, a computer program product according to claim 1, wherein said 
instructions are further executable to combine like relational fact types for the extracted 
relational facts produced in performing said extracting relational facts from the free text (first 
paragraph of 5.1.2). 

With respect to claim 14, a computer program product according to claim 1, wherein said 
instructions provide relationships with domain roles applied in performing said extracting 
relational facts from the free text (page 22, last paragraph). 

With respect to claim 15, a computer program product according to claim 1, wherein said 
instructions store the relational facts produced in performing said extracting relational facts from 
the free text (introduction, first paragraph). 

With respect to claim 16, a computer program product according to claim 1, wherein the 
extracted relational facts produced in performing said extracting relational facts and the 
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integrated data produced by the performance of said integrating the produced data includes 
reference information to the original free text (figure 3 and accompanying description). 

With respect to, claim 17, a computer program product located to one or more storage 
media devices usable to perform integration of mixed format data, said computer program 
product comprising instructions executable by a computer to perform the functions of: 

accessing a database containing data records, at least some of the data records containing 
both structured and unstructured data, the unstructured data including free text (page 18; 
retrieving documents from collections), 

using linguistic information to extract relational facts from the free text (beginning of 
page 19, first full paragraph of page 47; linguistic analysis/theory); 

producing a set of construed data reflecting at least one relational fact conveyed in said 
free text, each construed datum containing at least one relational fact, each construed datum 
being further relatable to the structured data in the data record from which said free text was read 
(step d) of figure 1, page 21, at least steps 4-7 on page 34, and processing stages on page 36 and 
figure 3 on page 39); 

integrating said construed data with the structured data of the data record to which said 
. construed data relates, said integrating step retaining reference information to the original free 
text (page 39 section 3.2.3 and second full paragraph of page 44); and 

constructing a library containing extracted attributes (figures 2 and 4). 
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With respect to claim 18, a method for integrating mixed format data, comprising the 
steps of: 

accessing a database containing data records, at least some of the data records containing 
both structured and unstructured data, the unstructured data including free text (page 18; 
retrieving documents from collections); 

producing a set of construed data reflecting at least one relational fact conveyed in free 
text, each construed datum containing at least one relational fact, each construed datum being 
further relatable to a data tuple of the structured data (step d) of figure 1, page 21, at least steps 
4-7 on page 34, and processing stages on page 36 and figure 3 on page 39); and 

integrating the produced data the structured data (page 39 section 3.2.3 and second full 
paragraph of page 44). 

With respect to claim 21, a method according to claim 18, flirther comprising the step of 
applying caseframes to. said free text (last paragraph of page 22, first paragraph of page 23, and 
figure 3 on page 39). 

With respect to claim 22, a method according to claim 18, further comprising the step of 
producing a new database containing the integrated data produced by said integrating step (page 
1, and page 50 section 5.1.3). 

With respect to claim 23, a method according to claim 18, further comprising the step of 
inserting the produced data into said database (introduction, first paragraph). 



Application/Control Number: 10/729,889 
Art Unit: 2167 



Page 9 



With respect to claim 24, a method according to claim 18, further comprising the step of 
creating a new database (figure 1 on page 20 and section 5.1.3.). 

With respect to claim 25. A method according to claim 24, wherein the new database is a 
relational database (page 52; conventional database and section 5.1.3.). 

With respect to claim 26, a method according to claim 24, wherein new database includes 
at least one file containing the integrated data produced by said integrating step (number 2 on 
. page 29 and figure 2 on page 35). 

With respect to claim 27, a method according to claim 26, wherein the new database has 
a format selected from the group of XML, character separated values, spreadsheet formats and 
file-based database structures (figure 1 on page 20 and number 2 on page 29). 

With respect to claim 28, a method according to claim 18, further comprising the step of 
combining like attributes for the extracted relational facts produced in performing said extracting 
relational facts from the free text (figures 4-5 and accompanying descriptions). 

With respect to claim 29, a method according to claim 18, further comprising the step of 
combining like relation types for the extracted relational facts produced in performing said 
extracting relational facts from the free text (introduction, first paragraph). 
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With respect to claim 30, a method according to claim 18, wherein domain roles are 
applied in said step of extracting relational facts from the free text (introduction, first paragraph). 

With respect to claim 31, a method according to claim 18, further comprising the step of 
storing the relational facts produced in performing said extracting relational facts from the free 
text (page 22, last paragraph)^ 

With respect to claim 32, a method according to claim 18, wherein the extracted 
relational facts produced in performing said extracting relational facts and the integrated data 
produced by the performance of said integrating the produced data includes reference 
information to the original free text (figure 3 and accompanying description). . 

With respect to claim 33, a computer program product according to claim 1, wherein said 
instructions are further executable to replace like or related attributes for relational facts with a 
common canonical representation based on those like or related attributes (first paragraph of 
page 19, first paragraph of page 27, and page 30). 

With respect to claim 34, a computer program product according to claim 1, wherein said 
instructions are further executable to replace like or related relation fact types with a common 
canonical representation. 
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Response to Arguments 
Applicant's arguments with respect to claim 1 have been considered but are moot in view 
of the new ground(s) of rejection. 

The Applicant argues on pages 12-13 that the Gaizauskas reference teaches accessing 
ONLY unstructured data as opposed to the presently claimed data records containing BOTH 
structured and unstructured ' data. The Examiner respectfully disagrees for specifically the 
following reason: 

As presented in at least figure 1 on page 20, Gaizauskas teaches in step a) a query for IR 
(information retrieval). That is, a query for retrieving relevant documents from a collection of 
documents. As an example result, step b) teaches a retrieved text. From the retrieved text, it can 
be seen that structured text denoted' by tags <DOCNO>, <HL>, <DD>, and <S0> and 
unstructured text in between tags <TXT> and </TXT> are both contained within the same data 
record (in this case, the retrieved text of step b)). Therefore the Examiner respectfully submits 
that Gaizauskas teaches said data records including both structured and unstructured data. 

The Applicant further argues on page 12 that the cited Gaizauskas reference does not 
teach producing relational facts from unstructured data and then relate them to structured data 
found in the same data record as the unstructured data. The Examiner respectfully disagrees 
because Gaizauskas teaches this limitation in respect to claims 1,17 and 18 above. Gaizauskas 
teaches at least with reference to figure 3 of deriving structural relations in a sentence. 
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Lastly, the Applicant argues that Gaizauskas does not disclose using linguistic 
information. The Examiner respectfully disagrees as this limitation is taught by Gaizauskas (see 
rejection of claims 1, 17 and 18 above). Therein Gaizauskas uses linguistic theory to analyze 
text. Figure 3 further represents use of linguistics (i.e. analyzing text). 
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Conclusion 

Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. - Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1,1 36(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Robert M. Timblin whose telephone number is 571-272-5627. 
The examiner can normally be reached on M-F 8:00-4:30. 

If attempts to reach the examiner by telephone are unsuccessfiil, the examiner's 
supervisor, John R. Cottingham can be reached on 571-272-7079. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 



Application/Control Number: 10/729,889 



Page 14 



Art Unit: 2167 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more informatiori about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



Robert M. Timblin 
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